Sound Context Classification based on Joint Learning Model and Multi-Spectrogram Features
نویسندگان
چکیده
This article presents a deep learning framework applied for Acoustic Scene Classification (ASC), the task of classifying different environments from sounds they produce. To successfully develop framework, we firstly carry out comprehensive analysis spectrogram representation extracted sound scene input, then propose best multi-spectrogram combination front-end feature extraction. In terms back-end classification, novel joint model using parallel architecture Convolutional Neural Network (CNN) and Recurrent (C-RNN), which is able to learn efficiently both spatial features temporal sequences input. The experimental results have proved our proposed general robust ASC tasks by three main contributions. Firstly, most effective indicated specific datasets that none publication previously analyzed. Secondly, CNN C-RNN achieves better performance compared with only baseline in this paper. Finally, competitive state-of-the-art systems on various benchmark IEEE AASP Challenge Detection Scenes Events (DCASE) 2016 Task 1, 2017 2018 1A & 1B, LITIS Rouen.
منابع مشابه
Classification of Fricatives Using Novel Modulation Spectrogram Based Features
In this paper, we propose the use of a novel feature set, i.e., modulation spectrogram for fricative classification. Modulation spectrogram gives 2-dimensional (i.e., 2-D) feature vector for each phoneme. Higher Order Singular Value Decomposition (HOSVD) is used to reduce the size of large dimensional feature vector obtained by modulation spectrogram. These features are then used to classify th...
متن کاملmortality forecasting based on lee-carter model
over the past decades a number of approaches have been applied for forecasting mortality. in 1992, a new method for long-run forecast of the level and age pattern of mortality was published by lee and carter. this method was welcomed by many authors so it was extended through a wider class of generalized, parametric and nonlinear model. this model represents one of the most influential recent d...
15 صفحه اولSingle Channel Music Sound Separation Based on Spectrogram Decomposition and Note Classification
Separating multiple music sources from a single channel mixture is a challenging problem. We present a new approach to this problem based on non-negative matrix factorization (NMF) and note classification, assuming that the instruments used to play the sound signals are known a priori. The spectrogram of the mixture signal is first decomposed into building components (musical notes) using an NM...
متن کاملon the comparison of keyword and semantic-context methods of learning new vocabulary meaning
the rationale behind the present study is that particular learning strategies produce more effective results when applied together. the present study tried to investigate the efficiency of the semantic-context strategy alone with a technique called, keyword method. to clarify the point, the current study seeked to find answer to the following question: are the keyword and semantic-context metho...
15 صفحه اولBody Mass Index Classification based on Facial Features using Machine Learning Algorithms for utilizing in Telemedicine
Background and Objectives: Due to the impact of controlling BMI on life, BMI classification based on facial features can be used for developing Telemedicine systems and eliminating the limitations of measuring tools, especially for paralyzed people. So that physicians can help people online during the Covid-19 pandemic. Method: In this study, new features and some previous work features were e...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Computing
سال: 2022
ISSN: ['2312-5381', '1727-6209']
DOI: https://doi.org/10.47839/ijc.21.2.2595